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Abstract — Adaptive inverse models find applications in 
communication and magnetic channel equalization, 
recovery of digital data and adaptive linearization of 
sensor characteristics. In presence of outliers in the 
training signal, the model accuracy is severely reduced. In 
this paper three robust inverse models are developed by 
recursively minimizing robust norms using BFO based 
learning rule. The performance of these models is assesses 
through simulation study and is compared with those 
obtained by standard squared norm based models. It is in 
general, observed that the Wilcoxon norm based model 
provides best performance. Moreover the squared error 
based model is observed to perform the worst. 

Index Terms — Adaptive inverse models, Robust norms, 
Robust adaptive inverse model, Bacterial foraging 
optimization 

I. Introduction 

In digital data communication systems, high speed data 
transmission over band limited channel often causes 
inter-symbol interference (ISI) due to adverse effect of 
dispersive channel [1]. The performance of linear 
channel equalizers is poor especially when the 
nonlinear distortion is severe [2]. In these cases, 
nonlinear equalizers are preferable [l]-[6]. Since 
artificial neural network (ANN) can perform complex 
mapping between its input and output space, different 
ANNs have been successfully used in nonlinear inverse 
modeling problem [l]-[6]. Functional link artificial 
neural network (FLANN) possess a low complexity 
structure and has been employed as an adaptive inverse 
model [7]-[8]. 

Recently, genetic algorithm (GA) has been used for 
nonlinear and blind channel equalization [9]-[10]. The 
operations in GA such as the crossover and the 
mutation, help to avoid local minima problem and thus 
provide improved solution. However there are some 
situations when the weights in GA are trapped to local 
minima. The bacterial foraging optimization (BFO) 
[11] like GA is a derivative free optimization technique 
and acts as an useful alternative to GA. The number of 
parameters that are used for searching the total solution 
space is higher in BFO compared to those in GA but on 
the other hand requires less number of computations. 



The possibility of BFO being trapped to local minimum 
is less. 

In recent years the BFO has been proposed and has 
been applied to many areas such as harmonic 
estimation of power system signals [12] and adaptive 
inverse modeling [13]. In case of derivative free 
algorithms conventionally the mean square error (MSE) 
is used as the fitness or cost function. Use of MSE as 
cost function leads to improper training of the 
parameters of adaptive model when outliers are present 
in the desired signal. The traditional regressors employ 
least square fit which minimizes the Euclidean norm of 
the error, while the robust estimator is based on a fit 
which minimizes another rank based norm called 
Wilcoxon norm [14]. It is reported that linear 
regressors developed using Wilcoxon norm are robust 
against outliers. Using this new norm robust machines 
have recently been proposed for approximation of 
nonlinear functions [15]. Inverse modeling finds many 
applications such as channel equalization in 
telecommunication [8], adaptive linearization in 
intelligent sensors [16] and digital control [17]. In all 
these applications the learning tools used are derivative 
based such as least mean square (LMS) or back 
propagation (BP) algorithms. When outliers are present 
in the training signals the inverse modeling 
performance of these algorithms degrades substantially. 
To the best of our belief no literature has dealt on the 
problem of development of robust inverse model in 
presence of outliers. Therefore our motivation in this 
paper is to address this problem and to provide 
effective solution using BFO based training of the 
models using a robust norms of error [15, 18, 19] as 
cost function. The choice of BFO as a training tool is 
due to its advantages over other evolutionary 
computing tool stated earlier. 

The paper is organized into following sections : 
Section II discusses inverse modeling problem. Robust 
adaptive inverse model is given in Section III. In 
Section IV the BFO based update algorithm is 
developed for the inverse model. Development of 
robust inverse model using BFO based training and 
robust norm minimization is dealt in Section V. For 
performance evaluation, the simulation study is carried 
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out and is dealt in Section VI. Finally conclusion of the 
paper is outlined in Section VII. 

II. ADAPTIVE INVERSE MODEL 




Fig. 1. Development of a robust inverse model using BFO based training 
cost function 



The input symbols are represented as x{k} at time 
instance, k . They are then passed to a channel (plant) 
which may be linear or nonlinear. An FIR filter is used 
to model a linear channel whose output at time instant 
k may be written as 



y( k ) = yVOMfr -0 



(i) 

where w(0 are the weight values and TV is the 
length of the FIR plant. The "NL" block represents the 
nonlinear distortion introduced in the channel and its 
output may be expressed as a nonlinear function of 
input and channel coefficients. 

z(k)=y/(x{k),x(k-l), ,x{k-N+\) 

w(0),w(l), ,w(N-\)), 

(2) 
where V^-) is some nonlinear function generated by 
the "NL" block. The channel output z(k} is 
corrupted with additive white Gaussian noise <?(£) of 
variance cr 2 . This corrupted received output is given 
by r (^0 . The received signal r (^0 is then passed 
into the adaptive inverse model to produce x(Js) 
which represents the input symbols x(k^) . From 
initial parameters, the weights are updated until the 
conventional cost function 



CR 



1 JL 



(k) 



(3) 



is minimized. The term TV stands for input samples 
used for training and the error term 
e(k) —x d {k) —x(k) . The received data is given 
by r(k) —z(k)-hg(k) . Conventionally the 



minimization of this cost function is performed 
iteratively by derivative based LMS algorithm. 

III. ROBUST ADAPTIVE INVERSE MODEL 

Three robust cost functions (RCF) defined in literature 
[15, 18, 19] are chosen in the development of robust 
adaptive inverse models. The RCFs are defined as 

A. Robust Cost Function- 1 (Wilcoxon Norm) [14,15] 

The Wilcoxon cost function is a pseudo-norm and is 

/ / 

defined as CFj = ^ a(R(v.))v.= ^ a(i)v. 

i= 1 i= 1 

(4) 

where ^(v ; ) denotes the rank of v ; among 

V 1,V 2 , ,v t , v (1) <v (2) < v (/) 

are the ordered values of v x , v 2 , , v, , 

«(0 = <#>'/(/ +1)] . In statistics, different types of 
score functions have been dealt but the commonly used 
one is given by <fi(u) =~JV2(u — 0.5) . 

B. Robust Cost Function-2 [18] 
It is defined as 



CF 5 =cr(l-exp(-e 2 /2o)) 



(5) 

where (J is a parameter to be adjusted during training 

and e 2 is the mean square error 

C. Robust Cost Function-3 (Mean Log Squared error) 
[19] 

The next cost function is defined as 

CF 4 =log(l+^) 

(6) 
where e 2 is mean square error. 

The weight-update mechanism of inverse model of Fig. 
1 is carried out by minimizing the cost functions of the 
errors defined in (4), (5) and (6) using BFO algorithm. 

IV. Development of Robust Nonlinear Inverse 

Models Using Bfo 

The updating of the weights of the BFO based inverse 
model is carried out using the training rule as outlined 
in the following steps: 

Step -1 Initialization of parameters 

(i) S b - No. of bacteria to be used for searching the 

total region 

(ii) iV = Number of input samples 

(iii) P = Number of parameters to be optimized 

(iv) -/V s = Swimming length after which tumbling of 

bacteria is undertaken in a chemotactic loop. 
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(v) N c ^Number of iterations carried out in a 

chemotactic loop, iv c > N s . 

(vi) N re = Maximum number of reproduction loop 

(vii) N ed — Maximum number of elimination and 

dispersal loop. 

(viii) P ed — Probability with which the elimination and 

dispersal takes place. 

(ix) The location of each bacterium P (1- P , 1-S b , 1) 

is specified by random numbers on [0,1]. 

(x) The value of run length unit, C(i) is assumed to be 

constant for all bacteria. 

Step-2 Generation of model input 

(i) Random binary input [1,-1] is applied to the 
channel/plant, 
(ii) The output of the channel is contaminated with 
white Guassian noise of known strength to generate the 
input signal, 
(iii) The binary input is delayed by half of the order of 

the inverse model to obtain the signal, X d (A:) . 

Step -3 Weight update algorithms 

In this step the bacterial population, chemotaxis, 

reproduction, elimination and dispersal are modeled. 

Initially j = k = 1 = 

(i) Elimination dispersal loop 1 = 1+1 

(ii) Reproduction loop k = k + 1 

(iii) Chemotaxis loop 7=7 + 1 

(a) For * = 1,2, 5^, the cost function 

J(i,j,k,l) for each i th bacterium is evaluated by the 

following way : 

(1) TV number of random binary samples are 

generated , passed through the model and the output is 

computed. 

(2)The output is then compared with the corresponding 

training signal, X d (&) to calculate the error, e(&) . 

(3)The robust cost functions of the error terms are 

computed as J(i,j,k,l) . 

(4)End of For Loop. 

(b)For* =1,2, ,5^ the 

tumbling/swimming decision is taken. 

Tumble : A random vector A(z'), with each element, 

A m (i),m = l,2, p, in the range of [-1, 1] is 

generated. The new position of I th bacterium in the 
direction of tumble is computed as 

P i (j+l,k,l)=P i (j,k,l)+C(i)x^J^ (7) 

The updated position is used to compute the new cost 
function (mean squared error) J(i,j + \k,l) . 
Swim - (i) Let c (counter for swim length) = 



(ii) While C < N^ , that is bacteria have not climbed 

down too long then update c = C + 1 

IfJ(J) < J(J ~ 1) then the new position of i th 

bacterium is computed by using (7). The updated 

position, P(j + l,k,l)is used to compute the new cost 

function, J(i, j + 1, k, I) 

ELSE c = N s . (End of the WHILE statement). 

(c)Go to next bacterium (;' + 1) if \ f S the process is 

repeated. 

(d)If min(J) {minimum value of J among all the 
bacteria} is less than the tolerance limit then all loops 
are broken. 

Step-4. If 7 < N c ,go to (iii) to continue chemotaxis 
loop since the lives of bacteria are not over. 

Step-5 Reproduction 

(a) For the given fc and / and for each 

i =1,2, , S h the bacteria are sorted 

in ascending order of cost functions, J (higher cost 
function means lower health). 

(b) The first half of bacteria population is allowed to 
die and the remaining half bacteria with least cost 
functions split into two and the copies are placed at the 
same location as their parent. 

Step-6. If k < N re go to Step-2. In this case, the 
number of specified reproduction steps has not reached 
and the next generation in the chemotactic loop is 
started. 

Step-7. Elimination -Dispersal 

The bacterium with an elimination-dispersal probability 

above a preset value p is eliminated by dispersing 

ed 

to a random location and new replacements are 
randomly made in the search space. By this approach 
the total population is maintained constant. 

V. Development of Robust Inverse Model Using bfo 
Based Training 

Three robust cost functions defined in literature [15, 18, 
19] are used to develop the robust inverse models. The 
BFO is then used to iteratively minimize these cost 
functions of the error obtained from the model. The 
resulting inverse model is expected to be robust against 
outliers. The weight-update of inverse model of Fig. 1 
is carried out by minimizing these cost functions of the 
errors defined in (4), (5) and (6) using BFO algorithm. 
In this approach, the procedure outlined in Steps- 1 to 7 
of Section IV remains the same. The only exception is 
detailed as follows : 

Let the error vector of P th bacterium at k th 
generation due to application of N input samples to 
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the model be represented as 

Ye Xp {k), e 2 p (k}, ,e„ p (k}~\ r . 

The errors are then arranged in an increasing manner 
from which the rank R { & n \ k )\ of each n th error 

term is obtained. The score associated with each rank 
of the error term is evaluated as 
i 



a(i) = yfl2{ 



JV + 1 



—0.5 



(8) 



where ( 1 < i < N ) denotes the rank associated with 
each error term. At k th generation of each p th 
particle the Wilcoxon norm is then calculated as 

N 

C (*)=2>(»)e (k) (9) 

(■ = 1 
Similarly other two robust CFs are also computed using 
(5) and (6). The learning process continues until the CF 
decreases to the possible minimum values. At this stage 
the training process is terminated and the resulting 
weight vector represents the final weights of the inverse 
model 

VI. Simulation Study 

In this section, the simulation study of the proposed 
inverse model in presence of 10% to 50% of outliers in 
the training signal is carried out. Fig. 1 is simulated for 
various nonlinear channels using the algorithm given in 
Sections V and VI. The transfer functions of three 
standard linear systems used in the simulation study are 

H x {z): 0.209 + 0. 995 z~ 1 +0.209z~ 2 
H 2 {z):0. 260 + 0. 930z~ ! +0. 260 z~ 2 (10) 
H 3 {z):0. 304 + 0. 903 z _1 +0.304z~ 2 

The eigen- value-ratio (EVR) of the plants given in (10) 
are 6.08, 11.12 and 21.71 respectively [8] which 
indicate the complexity of the plant or channel. A 
higher EVR indicates a bad channel in the sense that 
the convergence of the model becomes poor. To study 
the effect of nonlinearity on the inverse model 
performance, two different nonlinearities are 
introduced 

NLl:z{k)=tanh(y{k)) 

NL2:z{k)=y(k) + 0.2y 2 (k)-0.1y 3 (k) 



(11) 



where y ( k ) is the output of each of linear systems 
defined in (10). The additive noise in the channel is 
white Gaussian with -30dB strength. In this study an 8- 
tap adaptive FIR filter is used as an inverse model. The 
desired signal is generated by delaying the input binary 
sequence by half of the order of the inverse model. 
Outliers are added by simply replacing the bit value 
from 1 to -1 or -1 to 1 at randomly selected locations 
(10% to 50%) of the desired signal. In this simulation 
study, we have used the following parameters of BFO : 



S h =8, N is =100, p =8, N s =3, N c =5, N re 

=40-60, A^ rf =10, P ed =0.25, C(i)= 0.0075. 
This selection of parameters is based on achieving best 
inverse modeling performance through simulation. 
The bit-error-ratio (BER) plot of BFO trained inverse 
model pertaining to different nonlinear plants with 
different cost functions in presence of 0%-50% of 
outliers are obtained through simulation. The BER 
plots with 50% outliers only are shown in the Figs.2(a)- 
(f). In these figures the BER plots of four cost functions 
have been compared. To study the effects of EVR of 
the plant on BER performance the SNR is set at 15dB 
in presence of 0% and 50% outliers in the training 
signal and the results are shown in Fig. 3 for 50% 
outliers. Few notable observations obtained from these 
plots are : 

(a)Keeping the CF, SNR and percentage of outliers in 
the training signal same, the BER increases with 
increase in the EVR of the channel. Similarly under 
identical simulation conditions the squared error cost 
function based inverse model performs the worst where 
as the Wilcoxon norm based model provides the best 
performance (least BER). 

(b)As the outliers in the training signal increases the 
Wilcoxon norm based inverse model continues to yield 
lowest BER compared to those provided by other 
norms. 

(c)With no outlier present in the training signal, the 
BER plot of all four CFs are almost same 
(d)In presence of high outliers the conventional CF2 
based model performs the worst followed by CF4 based 
model. In all cases the Wilcoxon norm (CF1) based 
inverse model performs the best and hence is more 
robust against low to high outliers in the training signal. 
(e)The accuracy of inverse model based on CF3 and 
CF4 norms developed using outliers is almost identical. 
(f)In addition, the plots of Fig. 3 indicate that at 50% 
outliers in the training signal the BER increases with 
increase in the EVR of the nonlinear plants, 
(g) Further, the BER of the inverse models of all plants 
and SNR conditions is highest in square error norm 
(CF2) based training compared to three other robust 
norms used. However the Wilcoxon norm (CF1) based 
inverse model yields minimum BER among all cases 
studied. 




Fig. 2(a) Comparison of BER of four different CFs based nonlinear 

equalizers with [.209, .995, .209] as channel coefficients andNLl 

with 50% outliers 
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\ V 



Fig. 2(b) Comparison of BER of four different CFs based nonlinear 

equalizers with [.209, .995, .209] as channel coefficients and NL2 

with 50% outliers 




Fig. 2(c) Comparison of BER of four different CFs based nonlinear 

equalizers with [.260, .930, .260] as channel coefficients and NL1 

with 50% outliers 




Fig. 2(d) Comparison of BER of four different CFs based nonlinear 

equalizers with [.260, .930, .260] as channel coefficients and NL2 

with 50% outliers 
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Fig. 2(e) Comparison of BER of four different CFs based nonlinear 

equalizers with [.304, .903, .304] as channel coefficients and NL1 

with 50% outliers 




Fig. 2(f) Comparison of BER of four different CFs based nonlinear 

equalizers with [.304, .903, .304] as channel coefficients and NL2 

with 50% outliers 




(b) NL2 
Fig. 3 Effect of EVR on the BER performance of the four 
CF-based equalizers in presence of 50% outliers 

Conclusion 

This paper examines and evaluates the learning 
capability of different robust norms of error when the 
training signal (of the inverse model) is contaminated 
with strong outliers. To facilitate such evaluation 
different nonlinear plants with varying EVRs are used. 
The population based BFO learning tool is developed 
to minimize four different norms. The robustness of 
these norms is assessed through extensive simulation 
study. It is in general observed that the conventional 
squared error norm (CF2) is least robust to develop 
inverse models of nonlinear systems under varying 
noise conditions whereas the Wilcoxon norm (CF1) is 
the most robust one. In terms of quality of 
performance, the norms are grouped in the order CF1, 
CF3, CF4 and CF2. 
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